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DETAILED ACTION 

1 . This office action is in response to correspondence filed May 23, 2008 in 
reference to application 10/781,443. Claims 1,3-11, and 13-22 are pending in the 
application and have been examined. 

Response to Amendment 

2. The amendments filed May 23 have been accepted and considered in this office 
action. Claims 1 , 3, 1 1 , 1 3, 21 , and 22 have been amended and claims 2 and 1 2 
cancelled. 

Response to Arguments 

3. Applicant's arguments with respect to claims 1,11,21 and 22 have been 
considered but are moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 103 

4. The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

5. Claimsl, 3-11, and 13-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Shriberg et al. (Can Prosody Aid the Automatic Classification of 
Dialog Acts in Conversational Speech?), from hereon Shribergi, in viewof Chino (US 
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Patent 5,761 ,637) and further in view of Dahlbacl< et al. (Empirical Studies of Discourse 
Representations for Natural Language Interfaces). 



6. Consider claim 1 , Shribergi teaches a method of determining a predictive model 
for discourse functions (abstract page 3) comprising the steps of: 

determining a training corpus of speech utterances (page 7, speech data section 
discusses the training corpus); 

determining discourse functions associated with the speech utterances in the 
training corpus, the discourse functions being determined based on a theory of 
discourse analysis (Pages 8-13, Dialog Act Labeling section discusses dialog act 
labeling and segmentation for the training corpus. It is inherent that in order to annotate 
discourse relations, and therefore analysis discourse, it must be based on some theory 
of discourse analysis); 

determining prosodic features associated with the speech utterances in the 
training corpus (Page 14-18 teach collecting prosodic features for the training corpus); 
and 

determining a predictive model of discourse functions by associating the prosodic 
features determined from the speech utterances in the training corpus with the 
discourse functions determined from the speech utterances in the training corpus (page 
19, Decision Tree Classifiers section, prosodic classifiers are trained on labeled training 
data). 
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wherein the predictive model of discourse functions is operable to predict from 
prosodic features of a specific recognized speech, a likelihood that speech utterances of 
the specific recognized speech reflect a specific discourse function (prosody models are 
developed for use in DA classification, Abstract.). 

Shribergi does not teach or suggest that discourse functions are determined 
automatically. 

In the same field of discourse analysis, Chino teaches determining discourse 
functions automatically (Abstract, apparatus generates discourse structures 
automatically.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to use automatic discourse analysis as taught by Chino in order to allow 
for the removal of human interference in the training process. 

Shribergi and Chino does not specifically teach: 

wherein the predictive model of discourse functions is operable to predict a 
likelihood of a first portion of a speech utterance being associated with a command 
directed at an application and a second portion of the speech utterance being 
associated with content being provided to the application. 

In the same field of discourse analysis, Dahlback teaches wherein the predictive 
model of discourse functions is operable to predict a likelihood of a first portion of a 
speech utterance being associated with a command directed at an application and a 
second portion of the speech utterance being associated with content being provided to 
the application (The discourse model is a reduced model for natural language interface 
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applications that includes query language command and other system input, page 292 
"The Discourse Model," which would be command and content.). 

Therefore it would have been obvious for the discourse model to be able to 
predict command and content as suggested by Dahlback using the discourse model of 
Shribergi and Chino in order to allow for accurate analysis and prediction of human- 
machine interactions. 

7. Consider claim 3, Chino teaches the method of claim 2, wherein the theory of 
discourse analysis is at least one of: the Linguistic Discourse Model, the Unified 
Linguistic Discourse Model, Rhetorical Structure Theory, Discourse Structure Theory 
and Structured Discourse Representation Theory (Figure 7 shows a generated 
discourse structure for the speech input, column 6 line 13- column 7 line 28.) 

8. Consider claim 4, Shribergi teaches the method of claim 1 , in which the 
predictive models are determined based on at least one of: machine learning, rules 
(page 19, Decision Tree Classifiers section, prosodic classifiers are trained on labeled 
training data. Tree classifiers is machine learning.). 

9. Consider claim 5, Shribergi teaches the method of claim 4, in which the machine 
learning based predictive models are determined based on at least one of: statistics, 
decision trees. Naive Bayes (page 19, Decision Tree Classifiers section). 
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1 0. Consider claim 6, Shribergi teaches the method of claim 1 , in which the prosodic 
features occur in at least one of a location: preceding, within and following the 
associated discourse function (Page 14, Prosodic Features section paragraph 1, 
example is given on how feature FO will change in an utterance, showing that it occurs 
within the discourse function.). 

1 1 . Consider claim 7, Shribergi teaches the method of claim 1 , in which the prosodic 
features are encoded within a prosodic feature vector (Page 14-18 teach collecting 
prosodic features for the training corpus. Table 10 on page 22 shows a feature set used 
for classification). 

1 2. Consider claim 8, Shribergi teaches the method of claim 7, in which the prosodic 
feature vector is a multimodal feature vector (Table 10 on page 22 shows a feature set 
used for classification.). 

1 3. Consider claim 9, Shribergi teaches the method of claim 1 , in which the 
discourse function is an intra- sentential discourse function (table 1 page 8 shows 
statements, which can be itra-sentential discourse.) 

14. Consider claim 10, Shribergi teaches the method of claim 1, in which the 
discourse function is an intra- sentential discourse function (table 1 page 8 shows 
Questions, which can be Inter-sentential discourse.) 
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1 5. Consider claim 1 1 , Shribergi teaclies a system of determining a predictive model 
for discourse functions (abstract page 3) comprising the steps of: 

an input/output circuit for retrieving a corpus of at least one speech utterance 
(page 7, speech data section discusses the training corpus, obvious it must be 
inputted.); 

determining at least one discourse function associated with speech utterances in 
the training corpus (Pages 8-13, Dialog Act Labeling section discusses dialog act 
labeling and segmentation for the training corpus. ); 

A processor (inherent in prosodic feature extraction and decision tree modeling) 

for: 

determining prosodic features associated with the speech utterances in the 
training corpus (Page 14-18 teach collecting prosodic features for the training corpus); 
and 

determining a predictive model of discourse functions by associating the prosodic 
features determined from the speech utterances in the training corpus with the 
discourse functions determined from the speech utterances in the training corpus (page 
19, Decision Tree Classifiers section, prosodic classifiers are trained on labeled training 
data), 

wherein the predictive model of discourse functions is operable to predict from 
prosodic features of a specific recognized speech, a likelihood that speech utterances of 
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the specific recognized speech reflect a specific discourse function (prosody models are 
developed for use in DA classification, Abstract.). 

Shribergi does not specifically teach a processor for determining at least one 
discourse function associated with the speech utterances. 

However, In the same field of Discourse analysis, Chino teaches a processor for 
determining at least one discourse function associated with the speech utterances in the 
training corpus, the discourse function being determined automatically based on a 
theory of discourse analysis (figure 1, collumn4 line 50, shows a conversation discourse 
apparatus. It is inherent that in order to annotate discourse relations, and therefore 
analysis discourse, it must be based on some theory of discourse analysis). 

Therefore it would be obvious to combine the machine discourse analysis as 
taught by Chino with the modeling system of Shribergi in order to eliminate the need for 
human intervention in the model training process. 
Shribergi and Chino does not specifically teach: 

wherein the predictive model of discourse functions is operable to predict a 
likelihood of a first portion of a speech utterance being associated with a command 
directed at an application and a second portion of the speech utterance being 
associated with content being provided to the application. 

In the same field of discourse analysis, Dahlback teaches wherein the predictive 
model of discourse functions is operable to predict a likelihood of a first portion of a 
speech utterance being associated with a command directed at an application and a 
second portion of the speech utterance being associated with content being provided to 
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the application (The discourse model is a reduced model for natural language interface 
applications that includes query language command and other system input, page 292 
"The Discourse Model," which would be command and content.). 

Therefore it would have been obvious for the discourse model to be able to 
predict command and content as suggested by Dahlback using the discourse model of 
Shribergi and Chino in order to allow for accurate analysis and prediction of human- 
machine interactions. 

16. Consider claim 13, Chino teaches the system of claim 12, wherein the theory of 
discourse analysis is at least one of: the Linguistic Discourse Model, the Unified 
Linguistic Discourse Model, Rhetorical Structure Theory, Discourse Structure Theory 
and Structured Discourse Representation Theory. (Figure 7 shows a generated 
discourse structure for the speech input, column 6 line 13- column 7 line 28.) 

17. Consider claim 14, Shribergi teaches the system of claim 11, in which the 
predictive models are determined based on at least one of: machine learning, rules 
(page 19, Decision Tree Classifiers section, prosodic classifiers are trained on labeled 
training data. Tree classifiers is machine learning.). 

18. Consider claim 15, Shribergi teaches the system of claim 14, in which the 
machine learning based predictive models are determined based on at least one of: 
statistics, decision trees. Naive Bayes (page 19, Decision Tree Classifiers section). 
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19. Consider claim 16, Shribergi teaclies tine system of claim 11, in which the 
prosodic features occur in at least one of a location: preceding, within and following the 
associated discourse function (Page 14, Prosodic Features section paragraph 1, 
example is given on how feature FO will change in an utterance, showing that it occurs 
within the discourse function.). 

20. Consider claim 17, Shribergi teaches the system of claim 1 1 , in which the 
prosodic features are encoded within a prosodic feature vector (Page 14-18 teach 
collecting prosodic features for the training corpus. Table 10 on page 22 shows a 
feature set used for classification). 

21. Consider claim 18, Shribergi teaches the system of claim 17, in which the 
prosodic feature vector is a multimodal feature vector (Table 10 on page 22 shows a 
feature set used for classification.). 

22. Consider claim 19, Shribergi teaches the system of claim 11, in which the 
discourse function is an intra- sentential discourse function (table 1 page 8 shows 
statements, which can be itra-sentential discourse.) 
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23. Consider claim 20 Shribergi teaches the system of claim 11 , in which the 
discourse function is an intra- sentential discourse function (table 1 page 8 shows 
Questions, which can be Inter-sentential discourse.) 

24. Consider claim 21 , Shribergi teaches an apparatus operable to generate a 
carrier wave encoded to transmit a control program, useable to program a computer to 
determine a predictive model for discourse functions, to a device for executing the 
prog ram, (abstract page 3. generating a carrier wave would be inherent if a computer is 
used, as one must be generated form the memory to execute instructions.) the control 
program comprising instructions for: 

determining prosodic features associated with speech utterances in a training 
corpus of speech utterances (Page 14-18 teach collecting prosodic features for the 
training corpus); and 

determining discourse functions associated with the speech utterances in the 
training corpus of speech utterances, the discourse functions being determined based 
on a theory of discourse analysis (Pages 8-13, Dialog Act Labeling section discusses 
dialog act labeling and segmentation for the training corpus. It is inherent that in order to 
annotate discourse relations, and therefore analysis discourse, it must be based on 
some theory of discourse analysis, page 7, speech data section discusses the training 
corpus); 
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determining a predictive model of discourse functions by associating the prosodic 
features with the discourse functions (page 19, Decision Tree Classifiers section, 
prosodic classifiers are trained on labeled training data), 

wherein the predictive model of discourse functions is operable to predict from 
prosodic features of a specific recognized speech, a likelihood that speech utterances of 
the specific recognized speech reflect a specific discourse function (prosody models are 
developed for use in DA classification. Abstract.). 

Shribergi does not teach or suggest that discourse functions are determined 
automatically. 

In the same field of discourse analysis, Chino teaches determining discourse 
functions automatically (Abstract, apparatus generates discourse structures 
automatically.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to use automatic discourse analysis as taught by Chino in order to allow 
for the removal of human interference in the training process. 

Shribergi and Chino does not specifically teach: 

wherein the predictive model of discourse functions is operable to predict a 
likelihood of a first portion of a speech utterance being associated with a command 
directed at an application and a second portion of the speech utterance being 
associated with content being provided to the application. 

In the same field of discourse analysis, Dahlback teaches wherein the predictive 
model of discourse functions is operable to predict a likelihood of a first portion of a 
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speech utterance being associated with a command directed at an application and a 
second portion of the speech utterance being associated with content being provided to 
the application (The discourse model is a reduced model for natural language interface 
applications that includes query language command and other system input, page 292 
"The Discourse Model," which would be command and content.). 

Therefore it would have been obvious for the discourse model to be able to 
predict command and content as suggested by Dahlback using the discourse model of 
Shribergi and Chino in order to allow for accurate analysis and prediction of human- 
machine interactions. 

25. Consider claim 22, Shribergi teaches Computer readable storage medium 
comprising: computer readable program code embodied on the computer readable 
storage medium, the computer readable program code usable to program a computer to 
determine a predictive model for discourse functions (abstract page 3. generating a 
carrier wave would be inherent if a computer is used, as one must be generated form 
the memory to execute instructions.) comprising the steps of: 

determining a training corpus of speech utterances (page 7, speech data section 
discusses the training corpus); 

determining discourse functions associated with speech utterances in the training 
corpus of speech utterances, the discourse functions being determined based on a 
theory of discourse analysis (Pages 8-13, Dialog Act Labeling section discusses dialog 
act labeling and segmentation for the training corpus. It is inherent that in order to 
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annotate discourse relations, and tlnerefore analysis discourse, it must be based on 
some theory of discourse analysis); 

determining prosodic features associated with the speech utterances in the 
training corpus of speech utterances (Page 14-18 teach collecting prosodic features for 
the training corpus); and 

determining a predictive model of discourse functions by associating the prosodic 
features with the discourse function (page 19, Decision Tree Classifiers section, 
prosodic classifiers are trained on labeled training data), 

wherein the predictive model of discourse functions is operable to predict from 
prosodic features of a specific recognized speech, a likelihood that speech utterances of 
the specific recognized speech reflect a specific discourse function (prosody models are 
developed for use in DA classification. Abstract.). 

Shribergi does not teach or suggest that discourse functions are determined 
automatically. 

In the same field of discourse analysis, Chino teaches determining discourse 
functions automatically (Abstract, apparatus generates discourse structures 

automatically.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to use automatic discourse analysis as taught by Chino in order to allow 
for the removal of human interference in the training process. 

Shribergi and Chino does not specifically teach: 
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wherein the predictive model of discourse functions is operable to predict a 
likelihood of a first portion of a speech utterance being associated with a command 
directed at an application and a second portion of the speech utterance being 
associated with content being provided to the application. 

In the same field of discourse analysis, Dahlback teaches wherein the predictive 
model of discourse functions is operable to predict a likelihood of a first portion of a 
speech utterance being associated with a command directed at an application and a 
second portion of the speech utterance being associated with content being provided to 
the application (The discourse model is a reduced model for natural language interface 
applications that includes query language command and other system input, page 292 
"The Discourse Model," which would be command and content.). 

Therefore it would have been obvious for the discourse model to be able to 
predict command and content as suggested by Dahlback using the discourse model of 
Shribergi and Chino in order to allow for accurate analysis and prediction of human- 
machine interactions. 

Conclusion 

26. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 
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A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to DOUGLAS C. GODBOLD whose telephone number is 
(571)270-1451 . The examiner can normally be reached on Monday-Thursday 7:00am- 
4:30pm Friday 7:00am-3:30pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571) 272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

DCG 

/Patrick N. Edouard/ 

Supervisory Patent Examiner, Art Unit 2626 



